rank | frequency | n-gram |
---|---|---|
1 | 8593 | -i |
2 | 6956 | -a |
3 | 3442 | -n |
4 | 2508 | -r |
5 | 2001 | -g |
rank | frequency | n-gram |
---|---|---|
1 | 2966 | -ni |
2 | 2713 | -an |
3 | 2573 | -ga |
4 | 2447 | -da |
5 | 1976 | -ng |
rank | frequency | n-gram |
---|---|---|
1 | 1923 | -ing |
2 | 1513 | -lar |
3 | 1488 | -dan |
4 | 1228 | -ini |
5 | 1041 | -iga |
rank | frequency | n-gram |
---|---|---|
1 | 1876 | -ning |
2 | 779 | -lari |
3 | 570 | -idan |
4 | 515 | -dagi |
5 | 466 | -arni |
rank | frequency | n-gram |
---|---|---|
1 | 580 | -ining |
2 | 452 | -larni |
3 | 422 | -rning |
4 | 385 | -larga |
5 | 354 | -arini |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings